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Qh ' Abstract 



Under the unifying umbrella of a general result of Penrose & Yukich [Ann. Appl. 
Probab., (2003) 13, 277-303] we give laws of large numbers (in the LP sense) for the 
total power- weighted length of several nearest-neighbour type graphs on random 
point sets in R d , d E N. Some of these results are known; some are new. We 
give limiting constants explicitly, where previously they have been evaluated in less 
' generality or not at all. The graphs we consider include the fc-nearest neighbours 

■ graph, the Gabriel graph, the minimal directed spanning forest, and the on-line 

\ nearest-neighbour graph. 
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1 Introduction 



X 

Graphs constructed on random point sets in R d (d G N), formed by joining nearby 
points according to some deterministic rule, have recently received considerable interest 
[201 1291 131] . Such graphs include the geometric graph, the minimal spanning tree, and (as 
studied in this paper) the nearest-neighbour graph and its relatives. Applications include 
the modelling of spatial networks, as well as statistical procedures. 

The graphs in this paper are based on edges between nearest neighbours, sometimes 
in some restricted sense. A unifying characteristic of these graphs is stabilization: roughly 
speaking, the configuration of edges around any particular vertex is not affected by changes 
to the vertex set outside of some sufficiently large (but finite) ball. Thus these graphs are 
locally determined in some sense. 

A functional of particular interest is the total edge length of the graph, or, more 
generally, the total power-weighted edge length (i.e. the sum of the edge lengths each 
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raised to a given power a > 0). The large-sample asymptotic theory for power- weighted 
length of stabilizing graphs is now well understood; see e.g. [T5], [20], [211 [2S1 [261 [211 13T] . 

In the present paper we collect several laws of large numbers (LLNs) for total power- 
weighted length from the family of nearest-neighbour type graphs, defined on independent 
random points on R d . We present these results as corollaries to a general umbrella theorem 
of Penrose & Yukich [26]. Some of the results (for the most common graphs) are known 
to various extents in the literature; others are new. We take a unified approach which 
highlights the connections between these results. 

In particular, all our results are explicit: we give explicit expressions for limiting 
constants. In some cases these constants have been seen previously in the literature. 

Nearest-neighbour graphs and nearest-neighbour distances in R d are of interest in 
several areas of applied science, including the social sciences, geography and ecology, 
where proximity data are often important (see e.g. [T6l [27]). Ad-hoc networks, in which 
nodes scattered in space are connected according to some geometric rule, are of interest 
with respect to various types of communication networks. Quantities of interest such as 
overall network throughput may be related to power-weighted length. 

In the analysis of multivariate data, in particular via non-parametric statistics, nearest- 
neighbour graphs and near-neighbour distances have found many applications, including 
goodness of fit tests, classification, regression, noise estimation, density estimation, di- 
mension identification, cluster analysis, and the two-sample and multi-sample problems; 
see for example [HI [71 El Ell EE21 [131 EH] and references therein. 

In this paper we give a new LLN for the total power-weighted length of the on-line 
nearest-neighbour graph (ONG), which is one of the simplest models of network evolution. 
We give a detailed description later. In the ONG on a sequence of points arriving in 
R d , each point after the first is joined by an edge to its nearest predecessor. The ONG 
appeared in [1] as a simple model for the evolution of the Internet graph. Figure [T] shows 
a sample realization of an ONG. 

Recently, graphs with an 'on-line' structure, in which vertices are added one by one 
and connected to existing vertices via some rule, have been the subject of considerable 
study in relation to the modelling of real-world networks. The ONG is one of the simplest 
network evolution models that captures some of the observed characteristics of real- world 
networks, such as spatial structure and sequential growth. 

We also consider the minimal directed spanning forest (MDSF). The MDSF is con- 
structed on a partially ordered point set in R d by connecting each point to its nearest 
neighbour amongst those points (if any) that precede it in the partial order. If an MDSF 
is a tree, it is called a minimal directed spanning tree (MDST). 

The MDST was introduced by Bhatt & Roy in [S] as a model for drainage or com- 
munications networks, in d = 2, with the 'coordinatewise' partial order =^*, such that 
(xi,yi) =4* (£2,2/2) iff x\ < £2 and y\ < y 2 . In this version of the MDSF, each point is 
joined by an edge to its nearest neighbour in its 'south-westerly' quadrant. In the present 
paper we give new LLNs for the total power-weighted length for a family of MDSFs in- 
dexed by partial orderings on R 2 , which include as a special case. Figure [T] shows an 
example of a MDSF under 



2 




0.4 D.6 OS 1° B3 W 06 B 1 



Figure 1: Realizations of the ONG (left) and MDSF under =<:* (right), each on 50 simulated 
uniform random points in the unit square. 

2 Notation and results 

Notions of stabilizing functionals of point sets have recently proved to be a useful basis 
for establishing limit theorems for functionals of random point sets in R d . In particular, 
Penrose & Yukich [251 126] prove general central limit theorems and laws of large numbers 
for stabilizing functionals. 

The LLNs we give in the present paper are all derived ultimately from Theorem 2.1 
of [25]) which we restate as Theorem [T] below before we present our results. 

In order to describe the result of [26] , we need to introduce some notation. Let d G N. 
Let || • || be the Euclidean norm on R d . Write card(A") for the cardinality of a finite set 
X C R d . For a locally finite point set X C R d , a > 0, and y G R d , let y + aX denote the 
set {y + ax : x G X}. Let B(x.;r) denote the closed Euclidean ball with centre x G R d 
and radius r > 0. Let denote the origin in R d . 

Let £(x; X) be a measurable [0, oo)-valued function defined for all pairs (x, X), where 
X C R d is finite and x G X. Assume £ is translation invariant, that is, for all y G R d , 
£(y + x; y + X) — £(x; X). When x ^ X, we abbreviate the notation £(x; X U {x}) to 
£(x; X). For our applications, £ will be homogeneous of order a > 0, that is £(rx; rA^) = 
r a £(x; Af) for all r > 0, all finite point sets X, and all x G Af. 

For any locally finite point set # C R d and any £ G N define 

£+(X;£) := sup (ess sup {£(0; (X f] B(0; £)) U A*) : A e (R d \B(0; £)) k }) , and 
fceN 

rO^) := inf (essinf{£(0; (X f] B(0; £)) U A*) : A e (R d \B(0;£)) k \) , 

where for = (x 1; . . . , x fc ) G (R d ) fc we put A* = {x\, . . . , x fc } (provided all /c vectors are 
distinct). Define the limit of £ on by 

:=limsup£ + (^;£). 



3 



We say the functional £ stabilizes on X if 

lim £ + {X- i) = lim r (*; = ■ 

^— >oo t— >oo 

Stabilization can be interpreted loosely as the property that the value of the functional 
at a point is unaffected by changes in the configuration of points at a sufficiently large 
distance from that point. 

Let / be a probability density function on RA For n G N let X n :— (X 1; X 2 , . . . , X n ) 
be the point process consisting of n independent random d-vectors with common density 
/. With probability one, X n has distinct inter-point distances; hence all the nearest- 
neighbour type graphs on X n that we consider are almost surely unique. 

Let Tii be a homogeneous Poisson point process of unit intensity on R d . The following 
general LLN is due to Penrose & Yukich, and is obtained from Theorem 2.1 of [26] together 
with equation (2.9) there (the homogeneous case). 

Theorem 1 Let q G {1,2}. Suppose that £ is homogeneous of order a and almost surely 
stabilizes on Tii, with limit £oo(^i)- If £ satisfies the moments condition 

sup E[£(n 1/d Xi,n 1/d X n ) p ] < oo, 1 
for some p > q, then as n oo, 

n- 1 ™ 1A ^) ^fcooCWi)] / /(x^-^dx, 

and the limit is finite. 

From this result we will derive LLNs for the total power- weighted length for a collection 
of nearest-neighbour type graphs. Let j G N. A point has a j-th nearest neighbour 

y G X \ {x} if card({z : z G X \ {x}, ||z - x|| < ||y - x||}) = j -1. 

For all x, y G R d we define the weight function 

w a (x,y) := ||x - y|| a , 

for some fixed parameter a > 0. By the total power- weighted edge length of a graph with 
edge set E (where edges may be directed or undirected), we mean the functional 

Y iw„(u,v)= y ii u - v ir- 

(u,v)eE (u,v)eE 
We will often assume one of the following conditions on the function / — either 

(CI) / is supported by a convex polyhedron in H d and is bounded away from and 
infinity on its support; or 

(C2) for weight exponent a G [0, d), we require that L d f(xY d ~ a ^ d dx < oo and J Rd ||x|| r /(x)dx < 
oo for some r > d/(d — a). 

In some cases, we take /(x) = 1 for x G (0, l) d and /(x) = otherwise, in which 
case we denote X n = U n = (Ui, U 2 , . . . , U n ), the binomial point process consisting of n 
independent uniform random vectors on (0, l) d . 

In the remainder of this section we present our LLNs derived from Theorem [TJ The- 
orems [21 [3j and [6] follow directly from Theorem [1] and results in [26], up to evaluation 
of constants, while Theorems H] and [5] need some more work. These results are natural 
companions, as are their proofs, which we present in Section [3] below; in particular the 
proof of Theorem [2] is useful for the other proofs. 
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Figure 2: Realizations of the 3-rd NNG' (left) and 5-NNG' (right), each on 50 simulated 
uniform random points in the unit square. 

2.1 The A;- nearest neighbours and j-th nearest neighbour graphs 

Let j G N. In the j-th nearest-neighbour (directed) graph on X, denoted by j-th NNG'(X), 
a directed edge joins each point of X to its j-th nearest-neighbour. 

Let k G N. In the /c-nearest neighbours (directed) graph on X, denoted /c-NNG'(A'), 
a directed edge joins each point of X to each of its first k nearest neighbours in X 
(i.e. each of its j-th nearest neighbours for j = 1, 2, . . . , k). Clearly the 1-th NNG' and 
1-NNG' coincide, giving the standard nearest-neighbour (directed) graph. See Figure [2] 
for realizations of particular j-th NNG', /c-NNG'. 

We also consider the /c-nearest neighbours (undirected) graph on X, denoted by 
/c-NNG(A'), in which an undirected edge joins x, y G X if x is one of the first k nearest 
neighbours of y, or y is one of the first k nearest neighbours of x (or both). 

From now on we take the point set X to be random, in particular, for n G N, we take 
X = X n . For d G N and a > 0, let £* ,a (X n ), C^^Xn) denote respectively the total power- 
weighted edge length of the j-th nearest-neighbour (directed) graph, /c-nearest neighbours 
(directed) graph on X n C R d . Note that 



(2) 



For d G N, we denote the volume of the unit <i-ball (see e.g. (6.50) in [T3]) by 

^:= 7 r d / 2 [r(l + (d/2))]- 1 . (3) 
Theorems [2] and H] below feature constants C(d, a, k) defined for d, k G N, a > by 



C(d, a,k) := v { 



a / d d T(k + 1 + (a/d)) 
d + a F(jfc) ' 



(4) 
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Our first result is Theorem [2] below, which gives LLNs for q' a (X n ) and C a f k (X n ), 
with explicit expressions for the limiting constants; it is the natural starting point for 
our LLNs for nearest-neighbour type graphs. Let supp(/) denote the support of /; under 
(CI), supp(/) is a convex polyhedron, under (C2) supp(/) is R d . 

Theorem 2 Let d G N. The following results hold, with p = 2, for a > if f satisfies 
condition (CI), and, with p = 1, for a G [0, d) if f satisfies condition (C2). 

(a) For j-th NNG' on R d we have, as n — > oo ; 

n (*-i)/<iCf a (X n ) J*+ v- a/d TU ~t^J d)) [ /(x) (d - a)/d dx. (5) 

r (j) isupp(/) 

(b) For fc-NNG' on R d we have, as n — > oo ; 

n {«-d)/d c d^ Xn) JZ+C(d,a,k) j /(x) (d - Q)/d dx. (6) 

In particular, as n — > oo, 

n i«-MC d £fa)-^C(d,a,k). (7) 

Remarks, (a) If we use a different norm on R d from the Euclidean, Theorem [2] remains 
valid with Vd redefined as the volume of the unit (i-ball in the chosen norm. 

(b) Theorem [2] is essentially contained in Theorem 2.4 of [26], with the constants eval- 
uated explicitly. There are several related LLN results in the literature. Theorem 8.3 of 
[3Tj gives LLNs (with complete convergence) for C^,(X n ) (see also [Hj); the limiting con- 
stants are not given. Avram & Bertsimas (Theorem 7 of [2]) state a result on the limiting 
expectation (and hence the constant in the LLN) for C^iUn), which they attribute to 
Miles [18] (see also p. 101 of [31]). The constant in [2] is given as 

1 _ 1/2 ^ r(z-(i/2)) 

2 ^ r(z) 
i=i v ; 

which simplifies (by induction on j) to 7r _1 / 2 r(j + (l/2))/r(j), the d — 2, a — 1 case of 
(jSJ) in the case X n =U n . 

(c) Related results are the asymptotic expectations of j-th nearest neighbour distances 
in finite point sets given in [S] and [19] . The results in [TH] are consistent with the a = 1 
case of our (J7J). The result in [9] includes general a and certain non-uniform densities, 
although their conditions on / are more restrictive than our (CI); the result is consistent 
with ([6]). Also, gives (equation (6.4)) a weak LLN for the empirical mean fc-nearest 
neighbour distance. With Theorem 2.4 of [26], the results in [9] yield LLNs for the total 
weight of the j-th NNG' and fc-NNG' only when d — 1 < a < d (due to the rates of 
convergence given in [9]). 

(d) Smith [28] gives, in some sense, expectations of randomly selected edge lengths for 
nearest-neighbour type graphs on the homogeneous Poisson point process of unit intensity 
in R d , including the j-th NNG', nearest-neighbour (undirected) graph, and Gabriel graph. 
His results coincide with ours only for the j-th NNG', since here each vertex contributes 



6 



a fixed number (j) of directed edges: equation (5.4.1) of [28] matches the expression for 
our C(d, 1, k). 



From the results on nearest-neighbour (directed) graphs, we may obtain results for 
nearest-neighbour (undirected) graphs, in which if x is a nearest neighbour of y and vice 
versa, then the edge between x and y is counted only once. As an example, we give the 
following result. 

For d G N and a > let N d,a {X n ) denote the total power- weighted edge length of the 
nearest-neighbour (undirected) graph on X n C R d . For d G N, let uid be the volume of 
the union of two unit d-balls with centres unit distance apart in R d . 

Theorem 3 Suppose that d G N , a > and f satisfies condition (CI). As n — »• oo, 

n (a - d)/d Af d ' a (X n ) 

r(l + («/d)) U a,d - I /(x) (d ~ a)/d dx. 



supp(/) 



In particular, when d = 2 we have, for a > 



nW/^W r(l + (a/2)) ^-/ 2 - | ) ) , (9) 

and when d = 2, a = 1, we get 



g \ l+(a/2)' 



L 2 1 1 I GlT 



3/2 

„-V^ W ^ ± _ . . 3775 08. (!0) 

Finally, when d = 1, a = 1, we have A/" 1 ' 1 (W n ) — > 7/18 as n — > oo. 

Remark. A pair of points, each of which is the other's nearest neighbour, is known as 
a reciprocal pair. Reciprocal pairs are of interest in ecology (see [27]). When a = 0, 
J\f d,0 (X n ) counts the number of vertices, minus one half of the number of reciprocal pairs. 

L 2 

In this case (|S]) says n~ 1 A/' d ' (A' n ) — > 1 — (iid/ '(2u>d))- This is consistent with results 
of Henze [T2] for the fraction of points that are the £-th nearest neighbour of their own 
k-th nearest neighbour; in particular, (see [T2] and references therein) as n — * oo, the 
probability that a point is in a reciprocal pair tends to Vd/ua- 



2.2 The on-line nearest-neighbour graph 

We now consider the on-line nearest-neighbour graph (ONG). Let d G N. Suppose 
xi, X2, . . . are points in (0, l) d , arriving sequentially; for n G N form a graph on vertex set 
{xi, . . . , x n } by connecting each point x», i = 2, 3, . . . , n to its nearest neighbour amongst 
its predecessors (i.e. x 1; . . . , Xj_i), using the lexicographic ordering on R d to break any 
ties. The resulting tree is the ONG on (xi, x 2 , . . . , x n ). 

Again, we take our sequence of points to be random. We restrict our analysis to the 
case in which we have independent uniformly distributed points Ui, U2, ... on (0, l) d . For 
d G N, a > and n G N, let O d ' a (U n ) denote the total power- weighted edge length of 
the ONG on sequence U n = (Ui, . . . , U n ). The next result gives a new LLN for O d ' a (U n ) 
when a < d. 
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Theorem 4 Suppose d G N and a G [0, d). With C(d, a, k) as given by we have that 
as n — > oo 

n^-^ d O d ' a (U n ) C(d, a, 1) = t,T Q/d r(l + (a/d)). (11) 

a — a a — a 

Related results include those on convergence in distribution of O d > a {U n ), given m UM 
for a > d (a > 1/2 in the case d = 1) and in [2Tj in the form of a central limit theorem 
for a G (0, 1/4). Also, the ONG in d — 1 is related to the 'directed linear tree' considered 
in [23]. 



2.3 The minimal directed spanning forest 

The minimal directed spanning forest (MDSF) is related to the standard nearest-neighbour 
(directed) graph, with the additional constraint that edges can only lie in a given direction. 
In general, the MDSF can be defined as a global optimization problem for directed graphs 
on partially ordered sets endowed with a weight function, and it also admits a local 
construction; see [51 [221 [23]. As above, we consider the Euclidean setting, where our 
points lie in H d . 

Suppose that X G R d is a finite set bearing a partial order =<!. A minimal element, or 
sink, of X is a vertex vo G X for which there exists no v G X \ {vo} such that v =<: Vo- 
Let S denote the set of all sinks of X . (Note that S cannot be empty.) 

For v G X, we say that u G X \ {v} is a directed nearest neighbour of v if u =<! v and 
|| v — u ll — II v — u '|| f° r a H vl' E X \ {v} such that u' ^ v. For each v G X \ S, let n v be 
a directed nearest neighbour of v (chosen arbitrarily if v has more than one). Then (see 
[22] ) the directed graph on X obtained by taking edge set E := {(v, n v ) : v G X \ S} is a 
MDSF of X. Thus, if all edge-weights are distinct, the MDSF is unique, and is obtained 
by connecting each non-minimal vertex to its directed nearest neighbour. In the case 
where there is a single sink, the MDSF is a tree (ignoring directedness of edges) and it is 
called the minimal directed spanning tree (MDST). 

9,4> 

For what follows, we consider a general type of partial order on R 2 , denoted ^, 
specified by the angles 9 G [0, 2n) and (f> G (0, ir]. For x G R 2 , let Cg^'x) be the closed 
half-cone of angle with vertex x and boundaries given by the rays from x at angles 9 
and 9 + 0, measuring anticlockwise from the upwards vertical. The partial order is such 
that, for X!,x 2 G R 2 , 

9,4> 

Xi =<! x 2 iff Xi G C^(x 2 ). (12) 

7r/2,7r/2 

We shall use =<;* as shorthand for the special case ^ , which is of particular interest, as 
in [5]. In this case (^1,^2) ^* (^1,^2) iff u± < v\ and u 2 < v 2 . The symbol ^ will denote 
a general partial order on R 2 . Note that in the case <j) = tt, (|12p does not, in fact, define 
a partial order on the whole of R 2 , since the antisymmetric property (x ^ y and y ^ x 
implies x = y) fails; however it is, with probability one, a true partial order (in fact, a 
total order) on the random point sets that we consider. 

We do not permit here the case = 0, which would almost surely give us a disconnected 
point set. Nor do we allow G (it, 2n], since in this case the directional relation fll2p is 
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not a partial order, since the transitivity property (if u ^ v and v =<! w then u ^ w) fails 
for (j) £ (-7T, 2n]. 

Again we take X to be random; set X — X n , where (as before) X n is a point process 
consisting of n independent random points on (0, l) 2 with common density /. When the 
partial order is =^*, as in [5], we also consider the point set X® := X n U {0} (where is 
the origin in R 2 ) on which the MDSF is a MDST rooted at 0. 

In this random setting, almost surely each point of X has a unique directed nearest 
neighbour, so that X has a unique MDSF. Denote by Ai a (X) the total power- weighted 
edge length, with weight exponent a > 0, of the MDSF on X. 

Theorem [5] presents LLNs for Ai a (X n ) in the uniform case X n = U n . However, the 
proof carries through to other distributions. In particular, if the points of X n are dis- 
tributed in R 2 with a density / that satisfies condition (CI) above, then (TT3]) holds with 
a factor of J supp ^ f(x)( 2 ~ a ^ 2 dx introduced into the right-hand side. 

e,<f> 

Theorem 5 Let d £ N and a £ (0,2). Under partial order ^ with 9 £ [0, 2ii) and 
4> £ (0, 7r], we have that, as n — > oo, 

n {a - 2),2 M a (U n ) (2/0) a/2 r(l + (a/2)). (13) 
Moreover, when the partial order is =4*, < f73j) remains true withU n replaced byU®. 



2.4 The Gabriel graph 

In the Gabriel graph (see [11]) on point set X C R rf , two points are joined by an edge 
iff the ball that has the line segment joining those two points as a diameter contains no 
other points of X. The Gabriel graph has been applied in many of the same contexts as 
nearest-neighbour graphs; see for example [30] . 

For d £ N and a > 0, let Q d,a (X) denote the total power- weighted edge length of 
the Gabriel graph on X C R d . As before, we consider the random point set X n with 
underlying density /. A LLN for Q d,a (X n ) was given in [26]; in the present paper we give 
the limiting constant explicitly. 

Theorem 6 Let d £ N and a > 0. Suppose that f satisfies (CI). As n — » oo, 

n {a - d)/d Q d ' a {X n ) v~ l a,d 2 d + a - 1 T{\ + {a /d)) f /(x) ( ^ a)/d dx. (14) 



3 Proofs 



3.1 Proof of Theorems H and [3] 

For j £ N, let rfj(x; X) be the (Euclidean) distance from x to its j-th nearest neighbour 
in X \ {x}, if such a neighbour exists, or zero otherwise. We will use the following form 
of Euler's Gamma integral (see equation 6.1.1 in [TJ). For a, b, c > 

i 

r a e- crb dr = - c -( a+1 V 6 r ((a + l)/b) . (15) 
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Proof of Theorem [2J. In applying Theorem [T] to the j-th NNG' and fc-NNG' functionals, 
we take £(x; X n ) to be (dj(x.; X n )) a , where a > 0. Then £ is translation invariant and 
homogeneous of order a. It was shown in Theorem 2.4 of [26] that the j'-th NNG' total 
weight functional £ satisfies the conditions of Theorem [T] in the following two cases: (i) 
with q — 2, if / satisfies (CI), and a > 0; and (ii) with g = 1, if / satisfies (C2), and 
< a < d. (In fact, in [26] this is proved for the fc-NNG' functional ^j=i(^j( x j <¥ n )) a , 
but this implies that the conditions also hold for the j-th NNG' functional (<i,(x; X n )) a .) 

The functional £(x; A n ) = (dj(x; A^))" stabilizes on 7Yi, with limit £00(^1) — (dj(0; 7ii)) c 
Also, the moment condition ([1]) is satisfied for some p > 1 (if / satisfies (C2) and a < d) 
or p > 2 (if / satisfies (CI)), and so Theorem [1], with q = 1 or g = 2 respectively, yields 
(using the fact that £ is homogeneous of order a) 

^-^fft) = n" 1 £ n a / d £(x; A n ) = n~ l £ £(n 1/d x; n 1 /^) 

Li 



£[6»(Wi)] I /(x^^dx. (16) 

J sunpf /") 



'supp(/) 

We now need to evaluate the expectation on the right-hand side of ([TBI) . For r > 

j'-i 

> r] = P[d,(0; Wi) > r 1 /"] = ^ P[card({5(0; r 1 / ) nHi}) = 2] 
= > - — —^exp(-v d r d/a ), 

i=0 

where is given by (j3J). So 

E = / P Koo (Wi) > r] dr = / V 1 d ,, J exp(-^r d />r. 

Interchanging the order of summation and integration, and using (|T5|) . we obtain 

^Koo(Wi)] = « (l rfl, r( z + i) -^tfj — ' (17) 

where the final equality follows by induction on j. Then from ([3]), (fTBT) and (fTTj) we obtain 
the j-th NNG' result ©. By the fc-NNG' result © follows from © with 

r{ A nh \_ .,-«/«« V- r (j + (q/rf)) _ a/d d Tjk+l + ja/d)) 

Proof of Theorem [31 The nearest-neighbour (directed) graph counts the weights of 
edges from points that are nearest neighbours of their own nearest neighbours twice, 
while the nearest-neighbour (undirected) graph counts such weights only once. 

Let g(x; X) be the distance from x to its nearest neighbour in X \ {x} if x is a nearest 
neighbour of its own nearest neighbour, and zero otherwise. Recall that di(pc;X) is the 
distance from x to its nearest neighbour in X \ {x}. For a > 0, define 

?(x i X):=(d 1 (x i X)) a -\(q(x;X)) a . 
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Then Ylxex £' '( x ' ^) ls ^ ne total weight of the nearest-neighbour (undirected) graph on 
X. Note that £' is translation invariant and homogeneous of order a. 

One can check that £' is stabilizing on the Poisson process Hi, using similar arguments 
to those for the j-th NNG' and fc-NNG' functionals. Also (see [26J) if condition (CI) holds 
then £' satisfies the moments condition (CQ) for some p > 2, for all a > 0. 

Let ei be a vector of unit length in R d . For d G N, let uj d := |5(0; 1) U B(ei; 1)|, the 
volume of the union of two unit d-balls with centres unit distance apart. 

Now we apply Theorem [1] with q = 2. We have 

n { a /d)-i N d,«^ Xn) = n -i g( n 1/d x;n 1/d X n ) 

ElCiHi)] [ /(x)( d -^dx, (18) 

where = #[(di(0; Hi)) a ] - (l/2)£[(g(0; Hi)) a \. Now we need to evaluate 

E[(q(0;Hi)) a ]. With X denoting the nearest point of Hi to 0, 

P[q(0; Hi) G dr] = P[{|X| G dr} n {fti n (5(0; r) U B(X; r)) = {X}}] 

= d^-V^V^-^^dr = dvar^e'^dr. 

So using (TT5]) we obtain 

S[(g(0; ^i)) Q ] = / dv d r d - 1+a e~^ rd dr = ^"^^^(l + (a/d)). (19) 
Jo 

Then from (1181) with (fl9l) and the j = 1 case of (1171) we obtain (JHJ). By some calculus, 
u 2 = (4tt/3) + (v^/2), which with the d = 2 case of © yields ©; for (HUD note that 
T(3/2) = 7r 1 / 2 /2 (see 6.1.9 in [I]). Finally, we obtain the statement for A^' 1 ' 1 (W n ) from the 
d = 1 case of (jHJ) since co>i = 3. □ 

3.2 Proof of Theorem H 

In order to obtain our LLN (Theorem H] above), we modify the setup of the ONG slightly. 
Let U n be a marked random finite point process in R d , consisting of n independent uniform 
random vectors in (0, l) d , where each point Uj of U n carries a random mark T(Uj) which 
is uniformly distributed on [0, 1] , independent of the other marks and of the point process 
U n . Join each point Uj of U n to its nearest neighbour amongst those points of U n with 
mark less than T(Uj), if there are any such points, to obtain a graph that we call the 
ONG on the marked point set U n . This definition extends to infinite but locally finite 
point sets. 

Clearly the ONG on the marked point process U n has the same distribution as the 
ONG (with the first definition) on a sequence Ui, U 2 , . . . , U n of independent uniform 
points on (0, l) d . 

We apply TheoremdJto obtain a LLN for O d > a (U n ), a G [0, d). Once again, the method 
enables us to evaluate the limit explicitly. We take / to be the indicator of (0, l) d . Define 
D(x; X) to be the distance from point x with mark T(x) to its nearest neighbour in 
X amongst those points y G X that have mark T(y) such that T(y) < T(x), if such 
a neighbour exists, or zero otherwise. We take £(x;A?) to be (-D(x; X)) a . Again, £ is 
translation invariant and homogeneous of order a. 
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Lemma 1 The ONG functional £ almost surely stabilizes on 

Proof. Although the notion of stabilization there is somewhat different, the same argu- 
ment as given at the start of the proof of Theorem 3.6 of (21] applies. □ 



Lemma 2 Let d G N, a G [0, d), and let p > 1 with ap < d. Then the ONG functional 
£ satisfies the moments condition (fl]j. 

Proof. Let T n denote the rank of the mark of Ui amongst the marks of all the points 
of U n , so that T n is distributed uniformly over the integers 1,2, ...,n. We have, by 
conditioning on T n , 

n 

E[(Z(n 1/d U 1 ;n 1 / d U n )) p ] = n" 1 ^ ^[(^(n^Ui; n 1/d Ui)) pa } 

i=l 

I! 

1 Y,(n/t) pa/d E[(d 1 (t 1 / d \J 1 ; i^Ui))*"]. (20) 



n 

i=l 



It was shown in [23] that there exists C G (0, oo) such that for all r > 
sup Pldxi^V i, i^ d Ui) >r}< Cexp(-r^ d /C). 



i>l 



Thus the last expectation in (1201) is bounded by a constant independent of i. So the final 
expression in (1201) is bounded by a constant times 

n 

n (pa-d)/d j-pa/d 
i=l 

which is uniformly bounded by a constant for ap < d. □ 

Proof of Theorem H. Let d G N. Let / be the indicator of (0, l) d , and f be the ONG 
functional £(x;£Y n ) = (-D(x; U n )) a . By Lemmas [1] and El £ is homogeneous of order a, 
stabilizing on Hi with limit £00(^1) = (D(Q; H\)) a , and satisfies the moment condition 
(Q]) for some p > 1, provided a < d. So Theorem [T] with q = 1 implies 

n ^/d)-i d,^ Un) = n -i (D(n l ' d K- n l ' d U n )r £?Koo(Wi)]- 

For u G (0, 1) the points of 7ii with lower mark than w form a homogeneous Poisson point 
process of intensity u, so by conditioning on the mark of the point at 0, 

£[6x,(Wi)] = / ^[(di(0; H u )) Q ]dn = f u- a ' d E[(d x {ty Hi)) a ]dn = ^—C{d, a, 1), 
Jo Jo d-a 

since we saw in the proof of Theorem [2] that E[(di(0; 7ii)) a ] = C(d, a, 1). □ 
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3.3 Proof of Theorem E 



In applying Theorem [T] to the MDSF, we take / to be the indicator of (0, l) 2 . We take 
£(x; X) to be (rf(x; X)) a , where d(x; X) is the distance from point x to its directed nearest 
neighbour in X \ {x}, if such a point exists, or zero otherwise, i.e. 



e(x;AT) = (d(x;Ar)) a with d(x; #) := min{||x - y|| : y G * \ {x}, y ^ x} (21) 



with the convention that min0 = 0. 

We consider the random point set U ni the binomial point process consisting of n 
independent uniformly distributed points on (0, l) 2 . However, as remarked before the 
statement of Theorem [51 the result (fT3|) carries through (with virtually the same proof) 
to more general point sets X n . 

We need to show that £ given by (121jl satisfies the conditions of Theorem [U As before, 
Tii denotes a homogeneous Poisson process on R 2 . 

Lemma 3 The MDSF functional £ given by l[21\) almost surely stabilizes on Hi with limit 

U(n 1 ) = (d(o ; n 1 )) a . 

Proof. Set R := d(0; Tii). Since > we have < R < oo almost surely. But then for 
any £ > R, we have £(0; (Ht n B(0; t)) U A) = R a , for any finite icR d \ B(0; £). Thus 
£ stabilizes on TCi with limit £<x>(Wi) 

We now give a geometrical lemma. For B C R 2 with B bounded, and for xGB, write 
dist(x; <9£?) for sup{r : -B(x; r) C B}, and for s > 0, define the region 



Lemma 4 Lei B be a convex bounded set in R 2 ; and let x G B. If Ag^it, s; B) (1 
dB(x; s) 7^ 0, and s > dist(x, 95), f/ien 



Proof. The condition ^ ^(x, s; B) n <9£?(x; s) 7^ says that there exists y G B PI Cg^x) 
with ||y — x|| = s. The line segment xy is contained in the cone Cg^x); take a half-line h 
starting from x, at an angle 0/2 to the line segment xy and such that h is also contained 
in Cfl^x). Let z be the point in h at a distance dist(x, dB) from x. Then the interior of 
the triangle xyz is entirely contained in ^^(x, s), and has area s sin(0/2)dist(x, dB)/2. 
□ 

Lemma 5 Suppose a > 0. Then the MDSF functional £ given by (21\) satisfies the 
moments condition (QJ) for any p < 2/ a. 

Proof. Setting R n := (0, n 1 / 2 ) 2 , conditioning on the position of Ui, we have 



e 



A 0t(j> (-K, s; B) := B(x; s 



)n5nc w (x). 



(22) 



A e ^(x,s;B)\ > ssin(0/2)dist(x,aB)/2. 



E[C(n 



= n 



Sr 



-n 



£[(£(x;n 



/ 2 W n _i)) p ]dx. 



(23) 
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n-1 



For x G P n set m(x) := dist(x, dR n ). We divide P n into three regions 

R n (l) := {x G R n '■ m(x) < rT 1/2 }; R n {2) := {x G P n : m(x) > 1}; 
P n (3) := {x G P n : n~ 1/2 < m(x) < 1}. 

For all x G P n , we have £(x; r^'^Un-x) < (2n) a ^ 2 , and hence, since Rn(l) has area at most 
4, we can bound the contribution to (l2"3j) from x G P n (l) by 

n" 1 / P[(£(x; n 1 / 2 W„_ 1 )) p ]dx < 4rr 1 (2n) pa/2 = 2 2 +P a /V pa - 2 )/ 2 , (24) 

which is bounded if pa < 2. Now, for x G Rn, with Ag^ defined at fl22l) . we have 

Pl^n^^s] < P[n 1 / 2 W n _ 1 n^(x, S ;i? n ) = 0] 

_ A l^(x,g;-Rn)i y 

< exp(l - |^(x, s;^)!), (25) 
since ^^(x, s; P n )| < n. For x G P n and s > m(x), by Lemma 0] we have 

|Aw(x, s; P„)| > s sin(0/2)m(x)/2 if A^(x, s; P n ) n 95 (x; s) ^ 0, 

and also 

P[d(x; n 1/2 W n „x) > s] = if A , (x, s; P n ) n <9P(x; s) = 0. 

For s < m(x), we have that \A g ^(yi, s; R n )\ = s 2 ((p/2) > s 2 sin(0/2). Combining these 
observations and (1251) . we obtain for all x G Rn and s > that 

P[d(x; n 1/2 Z4-i) > s] < exp(l - (s/2) min(s, m(x)) sin(0/2)), x G P n . 

Setting c = (1/2) sin(0/2), we therefore have for x G R n that 

/•oo 

P[(£(x; n 1 / 2 ^))"] = / P[d(x; n 1 / 2 ^) > r^^dr 

pm(x) ap poo 

< / exp (1 - cr 2/(ap) )dr + / exp (1 - cm(x)r 1/(Qp) )dr 



«/0 Jm(x) a P 
poo 

= 0(1) + ctpm(x)- pa / e^apu^du = 0(1) + 0(m(x)~ ap ). (26) 

J m(x) 2 

For x G P n (2), this bound is 0(1), and the area of R n {2) is less than n, so that the 
contribution to (l2"3"j) from R n {2) satisfies 

limsupn" 1 / P[(£(x;n 1/2 Z4_i)) p ]dx < oo. (27) 

n-»oo J R n {2) 

Finally, by fl26l) . there is a constant O G (0, oo) such that the contribution to (1231) from 
Pn(3) satisfies 



n 



1 [ P[(e(x;n 1 / 2 W n _ 1 )) p ]dx < Cn 1 ' 2 I y-°*dy 

< Cn- 1/2 m&x{logn,n {ap - 1)/2 }, 
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which is bounded if ap < 2. Combined with the bounds in (I24p and (1271) . this shows that 
the expression (1231) is uniformly bounded, provided ap < 2. □ 

For k e N, and for a < b and c < diet Uk,( a ,b]x(c,d] denote the point process consisting of 
k independent random vectors uniformly distributed on the rectangle (a, b] x (c, d]. Before 
proceeding further, we recall that if M(X) denotes the number of minimal elements, under 
partial order =^*, of a point set X C R 2 , then 

E[M{U HaMAcA )\ = E[M{U k )\ = 1 + (1/2) + ■ ■ ■ + {l/k) < 1 + log A;. (28) 

The first equality in ( 1281) comes from some obvious scaling which shows that the distribu- 
tion of M{Uk^a,b]x(c,d}) does not depend on a,b,c,d. For the second equality in (128]) . see 
e.g. [3]. 

Proof of Theorem [5l Suppose a G (0,2), and set / to be the indicator of (0, l) 2 . By 
Lemmas [3] and [5] the functional £, given at (12 ip . satisfies the conditions of Theorem [1] with 
p = 2/ a and 9=1. So by Theorem [TJ we have 

n W*>- 1 M a (U n ) = n~ l ^(^ 1/2 x;n 1 / 2 W„) (29) 

Since the disk sector Cg^x) n B(x.; r) has area (</>/2)r 2 , by Lemma [3] we have 

^(Wi) > *] = P[WinC^(0)nB(0;^°) =0] =exp(-(0/2)s 2 / a ). 
Hence the limit in (1291) is, using (115]) . 

£ Koo(Wi)] = / ^[£oc (Wi) > «] ds = «2( Q - 2 )/ 2 0- a/2 r(a/2), 

■/ o 

9,<t> 

and this gives us ( TTBl) . Finally, in the case where ^ is ( TT31) remains true when lA n is 
replaced by U®, since 

B[n< Q/2) - 1 |M a (l/°) - A* Q (W n )|] < 2 a/2 n {a/2) - 1 E[M(U n )} 1 (30) 

where M{U n ) denotes the number of ^*-minimal elements oiU n . By (128j) . -E[M(W n )] < 
1 + logn, and hence the right-hand side of (130]) tends to as n — > oo for a < 2. □ 

3.4 Proof of Theorem [6] 

Proof of Theorem O In applying Theorem [T] to the Gabriel graph, we take £(x.;X n ) 
to be one half of the total a power- weighted length of all the edges incident to x in the 
Gabriel graph on X n U {x}; the factor of one half prevents double counting. As stated 
in [26] (Section 2.3(e)), £ is translation invariant, homogeneous of order a and stabilizing 
on Tii, and if the function / satisfies condition (CI) then the moment condition (CQ) is 
satisfied for some p > 2. So by Theorem [T] with q — 2, 

n (<*/d)-igd,*( Xn ) = n ~ l t(n 1/d *;n 1/d X n ) 

£ [6»(Wi)] / /(x)( d - Q )/ rf dx. (31) 
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We need to evaluate the expectation on the right-hand side of (1311) . The net contribution 
from a vertex at to the total weight of the Gabriel graph on Tii is 



1 oo 

-5](4(0;Wi)) Q -l Efc , (32) 
fc=i 

where the factor of one half ensures that edges are not counted twice, df.(0;Tii) is the 
distance from to its k-th nearest neighbour in Tii, and Ek denotes the event that and 
its k-th. nearest neighbour in H± are joined by an edge in the Gabriel graph. 

Given that the point x G 7i\ is the k-th. nearest neighbour of 0, an edge between x 
and exists in the Gabriel graph iff the ball with and x diametrically opposed contains 
none of the other k — 1 points of Tii that are uniformly distributed in the ball centre 
and radius ||x||. Thus for k G N, 

^( ^y'T '-* 1 -^" <33) 

So from (132!) and (133]) we have 

.. oo 

EiUUi)] = -^(1 -2-^ £[(4(0;^))°] 



2 

k=l 



k-1 



2 ^V- - J V d T ns 

k=l v ' 

by (TT7|) . But by properties of Gauss hypergeometric series (see 15.1.1 and 15.1.8 of pQ) 

f (l- 2 ^)>- r (*y =r( l + (a/d))2^. 

Then with (I3~lj) the proof is complete. □ 
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